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Abstract: Empirical processes for stationary, causal sequences are considered. We establish 
empirical central limit theorems for classes of indicators of left half lines, absolutely contin- 
uous functions and piecewise differentiable functions. Sample path properties of empirical 
distribution functions are also discussed. The results are applied to linear processes and 
Markov chains. 

1 Introduction 

The theory of empirical processes plays a fundamental role in statistics and it has many 
applications ranging from parameter estimation to hypothesis testing (van der Vaart and 
Wellner, 1996). The literature of empirical processes for independent random variables is 
huge and there are many deep results; see Donsker (1952), Dudley (1978), Pollard (1984), 
Gine and Zinn (1984), Shorack and Wellner (1986), Ossiander (1987), van der Vaart and 
Wellner (1996). 

To deal with random variables such as time series that are dependent, one naturally asks 
whether results obtained under the independence assumption remain valid. Such asymp- 
totic theory is evidently useful for statistical inference of stochastic processes. Without the 
independence assumption, it is more challenging to develop a weak convergence theory for 
the associated empirical processes. One way out is to impose strong mixing conditions to 
ensure the asymptotic independence; see Billingsley (1968), Gastwirth and Rubin (1975), 
Withers (1975), Mehra and Rao (1975), Doukhan et al (1995), Andrews and Pollard (1994), 
Shao and Yu (1996), Rio (1998, 2000) and Pollard (2002) among others. Other special 
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processes that have been discussed include hnear processes and Gaussian processes; see 
Dehhng and Taqqu (1989) and Csorgo and Mielniczuk (1996) for long and short-range 
dependent subordinated Gaussian processes and Ho and Hsing (1996) and Wu (2003a) 
for long-range dependent linear processes. A collection of recent results is presented in 
Dehling, Mikosch and Sorensen (2002). In that collection Dedecker and Louhichi (2002) 
made an important generalization of Ossiander's (1987) result. 

Here we investigate the empirical central limit problem for dependent random variables 
from another angle that avoids strong mixing conditions. In particular, we apply a mar- 
tingale method and establish a weak convergence theory for stationary, causal processes. 
Our results are comparable with the theory for independent random variables in that the 
imposed moment conditions are optimal or almost optimal. We show that, if the process is 
short-range dependent in a certain sense, then the limiting behavior is similar to that of iid 
random variables in that the limiting distribution is a Gaussian process and the norming 
sequence is \fn. For long-range dependent linear processes, one needs to apply asymptotic 
expansions to obtain y^-norming limit theorems f Section I6.2.2j) . 

The paper is structured as follows. In Section El we introduce some mathematical 
preliminaries necessary for the weak convergence theory and illustrate the essence of our 
approach. Two types of empirical central limit theorems are established. Empirical pro- 
cesses indexed by indicators of left half lines, absolutely continuous functions, and piecewise 
differentiable functions are discussed in Sections El El and El respectively. Applications to 
linear processes and iterated random functions are made in Section El Section [71 presents 
some integral and maximal inequalities that may be of independent interest. Some proofs 
are given in Sections El and El 

2 Preliminaries 

Let (X„)ngZ be a stationary process. Denote by F and Fj^x) = J2i=i '^Xi<x, x G M, 
the marginal and empirical distribution functions. Let ^ be a class of measurable functions 
from R to E. The centered ^-indexed empirical process is given by 

{P^- P)g = - y29iX^) -m^i)] = / 9{x)d[Fr.{x) - F{x)], geQ. (1) 
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The weak convergence theory concerns the hmiting behavior of {{Pn — P)g '■ 9 ^ Q} under 
proper scahng. Our primary goal is to estabhsh abstract Donsker theorems for dependent 
random variables. Namely, we aim at finding appropriate conditions on and Q such 
that {y/n{Pn — P)g, 9 ^ Q} converges in distribution to some tight Gaussian process 
W = {W{g), geG}. 

To describe the weak convergence theory, some mathematical apparatus is needed. Let 
the triple P) be the probability space on which the process is defined; 

let be the set of functions z : Q ^ M. ioi which \\z\\g := sup^gg \^{9)\ < oo. A 

random element ^ with values in is said to be tight if, for every 6 > 0, there 

is a compact set Ks of \\ ■ \\g) for which P(,^ ^ Ks) < S. Assume that for any 

X G M, supggg \9{x) — Pgl < OO- Then ^/n{Pn — P) is a map from Vt to i^{Q). We say 
that y/n{Pn — P) converges weakly to the tight Gaussian process W if, for any bounded, 
continuous function h : ■ \\g) ^ M, lim„^oo lE*{/i[v/^(P„ - P)]} = E[h{W)]. 

Here E* denotes the outer expectation: E*Z = inf{Ef/ : U > Z and U is measurable}. 
The outer probability of an arbitrary set A C is given by P*(y4) = E*(1a). The outer 
expectation is introduced to deal with the measurability issue which emerges when Q is 
uncountable. See van der Vaart and Wellner (1996) for more on weak convergence theory. 

It is well-known that {^/n{Pn — P)g, g ^ Q} converges weakly to some tight Gaussian 
process {W{g), g ^ Q} if and only if the following two conditions are satisfied (Theorem 
1.5.4, van der Vaart and Wellner, 1996): 

(i) Finite- dimensional convergence: For any finite set of functions gi, . . . , gt & G, 

[\^{Pn — P)gi, . . . , \/n{Pn — P)gk] =^ a multivariate normal distribution. (2) 

(ii) Tightness: For every 6,ri > 0, Q can be partitioned into finitely many sets Qi, . . . ,Qp 
such that 



lim sup ] 



max sup I ^/n{Pn — P){g — h)\ > 5 
i<i<Pg,hegi 



< V- (3) 



The finite-dimensional convergence (i) is trivial if Xk are iid and E[(7^(Xi)] < oo. 
For dependent random variables, the central limit theorem (0) itself is an interesting and 
important problem and it has received much attention for more than a half century. Various 
strong mixing assumptions are imposed in early work [cf. Rosenblatt (1956), Ibragimov 
(1962), Bradley (2002), Doukhan (1994) and Peligrad (1996) among others] . Gordin (1969) 
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and Gordin and Lifsic (1978) proposed the martingale approximation method which does 
not require strong mixing conditions. This method is quite powerful in establishing central 
and non-central limit theorems; see Wu (2003a, 2003b, 2004a, 2004b), Wu and Woodroofe 
(2004), Hsing and Wu (2004) and Wu and Shao (2004), Gordin and Holzmann (2004) for 
some recent developments. Earlier contributions can be found in Hall and Heyde (1980), 
Kipnis and Varadhan (1986), Volny (1993) and Ho and Hsing (1997). 

Generally speaking, it is more challenging to verify the tightness condition (ii). Consider 
the special case of the indicator function class Q = {l.<s : s G M}. Let Rn{s) = ^/n[Fn{s) — 
F{s)]. Then Q requires that there exists a partition M = Uf^j^Tj such that 



lim sup 1 



max sup \Rn{s) — Rn(t)\ > 6 

l<i<Ps,KTi 



< T]. 



The verification of the preceding relation is not simple partly due to the fact that F„ is 
discontinuous. For iid random variables, there exist sophisticated tools such as chaining 
arguments and exponential inequalities. It is not straightforward to apply those tools to 
dependent random variables. 

In our problem the major interest is to obtain comparable results without the iid 
assumption. It is necessary to impose certain structural assumptions on the underlying 
process (X„)„g2 and the class Q. In the early literature, strong mixing conditions have 
been imposed on (X„) [for example, Doukhan et al (1995), Pollard (2002)]. Here we restrict 
ourselves to causal processes. Let (£fc)fceZ be independent and identically distributed (iid) 
random variables; let J be a measurable function such that 

) (4) 

is a proper random variable. Such processes are also known as causal Bernoulli shifts 
and they have received considerable attention recently; see Doukhan and Louhichi (1999), 
Prieur (2002), Doukhan (2003) among others. The framework is general enough to 
allow many interesting and important examples. Prominent ones are linear processes and 
Markov chains arising from iterated random functions; see Section [Ul For the class Q, 
we consider indicators of left half lines, absolutely continuous functions and piecewise 
differentiable functions. 

In this article we will apply the martingale method, and thus shed new light on this 
important problem. To illustrate the idea behind our approach, let JF„ = ( 
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and denote by = P(X„+i < the conditional distribution function of Xn+i 

given the sigma algebra cr(JF„). Assume throughout the paper that the conditional den- 
sity fe{x\J-'n) = {d / dx)Fe{x\J-'n) exists almost surely. Define the conditional empirical 
distribution function Fn{x) = Yl^=i ^ei^l^i)- Then 

- F{x) = - + - F{x)]. (5) 

The decomposition (0) has two important and useful properties. First, n[Fn{x) — Fn{x)] 
is a martingale with stationary, ergodic, and bounded martingale differences. Second, the 
function Fn — F is differentiable with derivative fn{x) — f{x), where fn{x) = {d/dx)Fn{x) = 
n~^Yl^=ife{^\^i)- These two properties are useful in estabhshing tightness. Wu and 
Mielniczuk (2002) gave a similar decomposition in the asymptotic theory for kernel density 
estimators of linear processes. 

The following notation will be used throughout the paper. Let w\{du) = (1 + \ u\)^{du) 
be a weighted measure. For a random variable ^ write ^ G L'^, g > 0, if ||^||q := [e(|^|'')]^/'' < 
oo. Write the norm ||^|| = ||^||2. Define projections VkS, = ^{^\^k) - E(^| J'fc-i), k e Z. 
Denote by Cg (resp. C-y, etc) generic positive constants which only depend on q (resp. 
7, yU. etc). The values of those constants may vary from line to line. For a function p(-), let 
pQ = {p{-)g{-) '■ g G Q}. For two sequences of real numbers (a„,) and (6„), write a„, ~ 6„ 
if lim„^oo a„/6„ = 1. 

3 Empirical distribution functions 

In this section we consider sample path properties and weak convergence of empirical 
distribution functions. Recall Rn{s) = y/n[Fn{s) — F(s)]. The classical Donsker theorem 
asserts that, if Xk, G Z, are iid random variables, then {i?„(s), s G M} converges 
in distribution to an F-Brownian bridge process. The result has many applications in 
statistics. To understand the behavior at the two extremes s = ±oo, we need to consider 
the weighted version {Rn{s)W (s) , s G m}, where W{s) — > oo as s — > ±oo. Clearly, if W is 
bounded, then by the continuous mapping theorem, the weak convergence of the weighted 
empirical processes follows from that of Rn- The Chibisov-O'Reilly Theorem concerns 
weighted empirical processes of iid random variables. A detailed account can be found in 
Shorack and Wellner (1986, Section 11.5). The case of dependent random variables has 
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been far less studied. For strong mixing processes see Mehra and Rao (1975) and Shao and 
Yu (1996). Section [3.11 generalizes the Chibisov- O'Reilly Theorem to dependent random 
variables. Section 1221 considers weighted modulus of continuity of Proofs of Theorems 
naiid|21are given in Section|H| Let the weight function W be of the form W{t) = (1 + \t\y 
for some 6 > 0. 



3.1 A weak convergence result 

Let m be a measure on R and Tn{6) = Y17=i ^(^' -^i) —nE[h{6, Ti)], where his a. measurable 
function such that ||/i(^, jFi) || < cxd for almost all 6 (m). Define 



a{h,m) = f]J [ \\Voh{9,J'jWm{de). 



(6) 



Let f^{9\J^k) = {d/dd)fe{d\Tk). In the case that h{9, T^) = fe{e\Tk) or h{9, Tk) = Wl^k), 
we write a{fe,m) or a{f'^,'m) for a{h,m). 

Theorem 1. Let 7 > and q > 2. Assume E(|Xi|''') < 00 and 

E[f^/\u\J^o)]w^-i+g/2{du) < 00. (7) 



In addition assume 
Then (i) 



E 



sup|/2„(s)|2(l + |s|) 



27/9 



.sell 



0{l) (9) 



and (a) the process {-R„(s)(l + |s|)'^/'^, s G M} converges weakly to a tight Gaussian process. 

An important issue in applying Theorem^ is the verification of (jH)), which is basically 
a short-range dependence condition. For many important models such as linear processes 
and Markov chains, (jS)) is easily verifiable; see Section |U1 In particular, if X„ is a linear 
process, then (jSI) reduces to the conventional definition of the short-range dependence of 
linear processes. 
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Remark 1. Let a„ = sup{|P(y4 n B) - f{A)f{B)\ : Ae Aq,B e Bn} be the strong mixing 
coefficients, where Ak = cr(Xj,z < k) and Bn = cr{Xj,j > n). If X„ is strong mixing, 
namely an 0, Rio (2000) showed that E[sup,gK l^n(s)P] < (1 + ^Y2=l "fc)(2 + logn)^; 
see Proposition 7.1 therein. Clearly the bound in Q is sharper. <0 

Corollary 1. Let 7 > 0. Assume that E(|Xi|'''+'^) < 00 and sup„ /e(n|jFo) < r hold for 
some 5 > and r < 00. Further assume that crlfsjUJi^^) < 00 and cr(/^, < 00. 

Then + \s\y^^, s G M} converges weakly to a tight Gaussian process. 

Proof. Let g = 2(7 + 5 + l)/(7 + 1) and 7' = 7g/2. Then 7' - 1 + g/2 = 7 + 5. We shall 
apply Theorem [T] with 7' and q. Since E[/g(M|jFo)] = f{u), ((Tj) follows from 

/ E[f^^\u\J^o)]wy_i+g/2idu) < r''/^-^ / E[/,(M|Jo)]wy„i+g/2(rfM) 

Note that 27'/^ = 7. Then (jH)) holds and the corollary follows from Theorem <0 

It is interesting to compare Corollary ^ with the Chibisov- O'Reilly Theorem, which 
concerns weighted empirical processes of iid random variables. The moment condition 
E(|Xi|'''+'^) < 00 of Corollary [T] is almost necessary in the sense that it cannot be replaced 
by the weaker one 

E{|Xiriog-\2 + |Xi|)[loglog(10+ |Xi|)]-^} < 00 (10) 

for some A > 0. To see this let Xk be iid symmetric random variables with continuous, 
strictly increasing distribution function F; let be the quantile function and m{u) = 
[1 + \F*(u)\]~"'^'^. Then we have the distributional equality 

{Rn{s)il + \s\y/^ sem}=v {Rn{F*{u))/m{u), u G (0, 1)}, 

Assume that F{s){l + |s|)'^ is increasing on (—00,6*) for some G < 0. Then m{u)j^fu 
is decreasing on (0,F(G)). By the Chibisov-O'Reilly Theorem, {Rn{F"^{u)) /m{u),u G 
(0, 1)} is tight if and only if limtj^o tnit) / ^/t log log(t~^) = 00, namely 

lim F(u)(l + |u|)'^loglog|u| = 0. (11) 
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The above condition controls the heaviness of the tail of Xi. Let F{u) = |u|~'*'(loglog 
for u < —10. Then it is easily seen that (fTUI) holds, while (fTT|) is violated. It is unclear 
whether stronger versions of (|Tn|l such as E(|Xi|^) < oo or E[|Xi|'>' log^^(2 + |Xi|)] < oo 
are sufficient. 



3.2 Modulus of continuity 



Theorem 121 below concerns the weighted modulus of continuity of Rn{-)- Sample path 
properties of empirical distribution functions of iid random variables have been extensively 
explored; see for example Csorgo et al. (1986), Shorack and Wellner (1986) and Einmahl 
and Mason (1988) among others. It is far less studied for the dependent case. 

Theorem 2. Assume EdXil"^) < oo and snp^ fs{u\J-'o) < r for some 7 > and r < 00. 
Let 6n < 1/2 be a sequence of positive numbers such that (logn)^'^/*^'^"^^ = 0{nSn), where 
2 < q < A. Further assume that, for some fi < 1, 

Cr(/e, W2y/g^^,) + Cr(/^, tf27/g+M) < OO. (12) 

Then there exists a constant < C < oo, independent of n and 5n, such that for all n>l. 



E 



SUp(l + \t\f^''' sup \Rnit + S)- Rnitf 



(13) 



4 Absolutely continuous functions 

Let AC be the collection of all absolutely continuous functions : M i-^ M. In this section 
we shall investigate the behavior of the empirical process ^/n{Pn — P)g indexed hy g E Q^^^ 
OT g E 'Hri,5, where Q-y^^, 7 > 0, yU < 1, is the weighted Sobolev class 

g^,^=\geAC: / g\u)w^^^^{du) + / [g' {u)fw_^^^{du) < 1 \ (14) 
I Jr Jr ) 

and the class 7^,^,5, ?7, ^ > 0, 77 — 5 < 1, is given by 

= {geAC: \g{u)\ < (1 + |n|)^ \g'{u)\ < (1 + |n|)^ for all u G M}. (15) 

Functions in Q^^^ can be unbounded if 7 + > 1. See Remark |21 for some properties of 
Q^^^ and Tir^^s- In the study of empirical central limit theorems for independent random 
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variables, bracketing conditions are often imposed. The class of differentiable functions is 
an important case that bracketing conditions can be verified; see Chapter 2.7 in van der 
Vaart and Wellner (1996). Functions considered in robust inference are often absolutely 
continuous. 

Theorem 3. (i) Let 7 > and < 1. Assume EdXil"^) < 00 and 

cr(/e,w^+^) < 00. (16) 

Then {^Jn{Pn — P)g : g G ^7,^} converges weakly to a tight Gaussian process {W{g) : g E 
Q-y^ii} with mean and finite covariance function 

cov[W{gi),W{g2)] = ^ cov[^i(Xo), ^2(^fc)], ^1,^2 e ^7,^. 

(a) Let ?7, 5 > 0. Assume that 

00 

J]rVP[j"<l^il<(j + l)"]<oo (17) 
3=1 

if f] — 6 < 1, where a = {1 + 6 — ri)~^ , and that 

00 

^2^Vp(2^' < l^il < 2^'+^) < 00 (18) 

if rj ~ 5 = 1. In addition, let m{dx) = (1 + |a;|)^+^'' log^(2 + |a;|)(ia; and assume 

a(/„m)<oo. (19) 
Then the conclusion in (i) still holds with Q^^^ replaced by Tiri^s- 

The proof of Theorem El is given in Section 14.11 Theorem El generalizes the empirical 
central limit theorems in Gine and Zinn (1986) in two ways: by allowing dependence and 
by considering wider classes. Gine and Zinn (1986) considered iid random variables and 
Lipschitz continuous functions. In particular, they show that (i) if 77 = 5 = (the class 
of bounded Lipschitz continuous functions), then ()17|) is necessary and sufficient for the 
tightness of {^/n{Pn — P)g : g G 7Yo,o} and (ii) if 5 = 0, then (fTH|l is necessary and sufficient 
for the tightness of {^/n{Pn — P)g : g G Ho, }, where TYq,- = {g ■ \g'iu)\ < 1} is the class 
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of Lipschitz continuous functions. Further consideration is given in van der Vaart (1996). 
So in the cases of r/ = 5 = and 6 = 0, rj = 1, the conditions (fT7|) and (fTHj) are optimal. 

The dependence structure of (^fc)fcgZ 'wiU inevitably find its way into the weak conver- 
gence theory. Here the assumption on the dependence is encapsulated as a{h, m) < oo, 
which is a consequence of the requirement of y^-normalization of the partial sums in view 
of ^^\\Tn{0)\\'^m[d9) < na'^ih^m) by LemmalU Conditions (fT^ and (fT^ actually imply 
more. They are natural conditions for the asymptotic normality of ^/n{Pn — P)g for a 
fixed g G Q^^^ or g & 'H-q.s] see the proof of Theorem |21 in Section I^TTl If X„ is a long-range 
dependent linear process, then (fTB|) and (fT^ are violated and the norming sequence of 
{Pn — P)g is different from ^/n. In this case asymptotic expansions are needed to ensure 
a Gaussian limit process with a i/n-normalization (cf. Section Ffi. 2. 2j) . 

Doukhan et al (1995), Rio (1998) and Pollard (2002) considered stationary, absolutely 
regular processes. Bracketing conditions are given based on a metric which involves mixing 
coefficients. Rio (2000, Theorem 8.1) established an empirical central limit theorem for 
strong mixing processes indexed by Lipschitz continuous functions. In the case of causal 
processes, it is not easy to verify that they are absolutely regular or strong mixing. For 
linear processes to be strong mixing, quite restrictive conditions are needed on the decay 
rate of the coefficients (Doukhan, 1994). In comparison. Theorem IHl and Corollary |21 (cf. 
Section 16.2.11) impose a sharp condition on the coefficients of linear processes. 

Some new weak dependence conditions are used in Doukhan and Louhichi (1999), 
Prieur (2002) and Dedecker and Prieur (2003b). Here we compare our results with theirs 
by applying them to the Gaussian process X„ = Yl^o CLi^n-i, where Oj = (i + l)"'^, /5 > 1, 
and En are iid standard normal. As mentioned in the preceding paragraph. Theorem Inland 
Corollary 121 impose the sharp condition (3 > 1. Proposition 2 in Doukhan and Louhichi 
(1999) asserts that i?„ is tight if 

sup |cov[/(XiJ/(XiJ, f{Xt,)f{XtM = 0{n-'/'-') (20) 

for some 6 > 0, where X = {x — ls<x<t '■ s,t G M}, < ti < ^2 ^ ^3 ^ ^4 and 
n = ts — t2- Let ti = ^2 = 0, ts = ^4 = 77, and f{x) = lx<o- Elementary manipulations 
show that the covariance r{n) = E(XoX„) ~ Cp'n}''^^ for some constant < < oo and 
|P(Xo < 0, X„ < 0)-P(Xo < 0)P(X„ < 0)1 = 0[r{n)]. So ^ requires 1 - 2/3 < -5/2-5, 
namely (3 > 7/4. The empirical central limit theorem in Dedecker and Prieur (2003b) is 
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not applicable to the process X„. Actually, Corollary 4 in the latter paper assumes that 
YlkLi'^'W < where (f){k) = sup^g]^ ||P(Xfc < t\J^o) — ¥{Xk < t)||oo- It is easily seen 
that (f){k) = 1 for the process Xk- By Theorem 1 in Prieur (2002), Rn converges weakly 
if E\Xn — ^"=0*^*^"-*! ~ 0{n~'^~'^^~^) for some 6 > 0. The latter condition implies 
(3 > 2.5 + 2v^- On the other hand, the existence of the conditional density /e(x|jF„) is not 
assumed in those papers, while it is needed in our results. 

Remark 2. We say that two classes Qi and Q2 are equivalent, denoted by Qi ^ Q2, if there is 
a constant A such that X^^Qi C Q2 C XQi. In this case the process {y/n{Pn — P)g : g G Qi} 
converges weakly if and only if {y/n{Pn — P)g : g & Q2] does. 

If ?7 > 1 + 5, then Ti^ s ~ ^^5+1,5. Clearly T-Ls+i,5 C Tir^^s- Let -u > 0. Then 



\9iu)\<\giO)\+ \g'{t)\dt<l+ {l+tYdt 
Jo Jo 



< ii + uY+' 



and hence Hr^^s C Hs+i^s- The two classes Q^^^ and Hrj^s are closely related. If 7 > 1 + 7] + 6, 
then Hri^s C C^,r,,5^7,7,-<5 for some C^^r],s < 00. There is no converse inclusion. 
A particularly interesting case is when 7 > and /i = 1. Let 



6;;^,o = {9^ AC: giO) = and / [g'iu)]'w.,+^idu) < 1} 

Jr 



(21) 



and ^-^,^,0 = {ge g^,^ : g{0) = 0}. By Hardy's inequality ^^,1,0 C (1+47-2)1/2^*^^^^. 
The other relation ^* 1 C ^7,1,0 is obvious. Therefore Q^^ifi ~ ^7,1,0- By Lemma El 
for g G G-y,fj., sup^5'2(x)(l + |x|)-''' < C^^^j for some C^,^^ < 00. So g^{0) < C^^^ and 
there exists a constant Ci^^^ such that g — g{0) G C'^^^^Q^^^ whenever g G Q-y,^- Therefore 
6^7,1,0 ~ {^7 - ^?(0) : 9 e 6^7,i}. Notice that (P„ - P)((? - (7(0)) = (P„ - P)(7. So the weak 
convergence problem of {^/n{Pn — P)g : g G ^7,1} is equivalent to the seemingly simpler 
one {V^{Pn -P)g: gE G;^,^,}. 

4.1 Proof of Theorem [HI 

We first consider the case Q = Q-^,^- Following the ideas behind the decomposition (0), 
write ^/n{Pn - P)g = Mn{g) + Nn{g), where 

Mn{g)=n'/^ [ g{x)d[Fn{x) - F^x)] and NM = n'^^ [ g{x)[fn{x) - f{x)]dx. 
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The tightness of the process {y/n{Pn — P)g : g G ^7,^4} follows from that of {Mn{g) : g G 
Q'^,^} and {Nn{g) : g G ^7,^,}, which are asserted by Propositions [T] and |21 (cf. Sections 
14.21 and I4.3|l respectively. It remains to establish the finite-dimensional convergence. By 
Lemma El there exists C^^^ < 00 such that 

g'^ix) [ [ 

xeM (1 + FI)^ Jm. Jr 

Hence < C7,^e[(|Xi| + 1)'^] < 00. Notice that for n > 1, 

Vog{Xn) = VoE[g{Xn)\J^n-l] = Vo / g{x) fe{x\Tn~l)dx = / g{x)Vofe{x\J^n-l)dx. 

JR JR 

By the Cauchy-Schwarz inequality, 

\\VogiXn)\\ < \ g^{x)w^j^f,{dx) / \\Vofe{x\Tn-i)rw^+t,idx), 
V JR 

which in view of (jlfi|l is summable. By Lemma ^ the finite-dimensional convergence holds. 

The other case that Q = Tir^^s similarly follows. For g G \g{u)\ < (1 + \u\)^. By 

()17|) or p8p. we have E(|Xip'') < 00, which by Lemma ^ implies the finite-dimensional 
convergence. The tightness follows from (ii) of Proposition ^ and Proposition |21 <0 



4.2 Tightness of M„ 

In this section we establish the weak convergence of M„. The tightness of M„ involves 
moment conditions on Xi and sizes of Q^^^ or Tirj^s, which are characterized by the param- 
eters (7, /i) and (?7, S). There is a tradeoff between the moment conditions and the sizes of 
the classes: larger classes require stronger moment conditions. 

Interestingly, the tightness of M„ does not involve the dependence structure of {Xk)k^z- 
In Section Ffi. 2. 2^ we apply the result to strongly dependent processes. 

Proposition 1. (i) Assume that EdXil''') < 00. Then the process {Mn,{g) '■ g G Q-y^fi} is 
tight. Consequently, it converges weakly to a tight Gaussian process {WM{g) ■ g G ^7,^} 
with mean and covariance function 

coY[WMigi),WM{g2)]=H[Pogi{X,)][Vog2{X^)]}, gug2 e G^,^. (22) 
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(a) Letr],5 > 0. Assume ifr]-5 < 1 and HT^) ifr]-5 = 1. Then {Mn{g) : g E Hr^^s} 
converges weakly to a centered, tight Gaussian process {WM{g) '■ g E Hri^s} with the 
covariance function ^2^) . 

The proof of Proposition ^ is given in Section El Dedecker and Louhichi (2002) gen- 
eralized Ossiander's (1987) method to dependent sequences. They apply their results 
to martingales (cf. Theorem 3.3 and Section 4.2 therein). A related result is given in 
Nishiyama (2000). The conditions imposed in the latter paper seem hard to work with; 
see van der Vaart and Wellner (1996, p. 212). Here we compare our Proposition ^ with 
that of Dedecker and Louhichi (2002). 

For a class Q and a norm d let N{5, Q, d) be the minimum number of 5-brackets needed 
to cover Q. Here for two functions / and m, the bracket is defined by = {g E Q '■ 
K^) — fi'(^) — u{x) for all x}. If d{u — I) < S, then we say that [l,u] is a 5-bracket. For a 
function g define its essential supremum ^2 norm by 



Dedecker and Louhichi (2002) show that the £°°-valued map \/n{Pn — P) is asymptotically 
(i2-equicontinuous if 



However, the norm d2 in (j^!^ is so strong that ()24|1 is violated for many important applica- 
tions. For example, let Xn = Xli^o "^here Sk are iid random variables. Assume that 
the support of ei is the whole real line, ao = 1 and that there are infinitely many non-zero 
coefficients. Let Qh = {gi- ^ ^ 1^}) where g{x) = max[— 1, min(x, 1)] is the derivative 
of a Huber function. Elementary calculations show that, since Yq = Xi — ei has the support 
of the whole real line, the bracket number N{6, Qh, (^2) = 00 for all < 5 < 1/2. Neverthe- 
less, their generalization is quite important since the condition ()24j) is tractable in many 
cases. We apply their result in the proof of Proposition ^ and overcome the limitation by 
using a truncation argument. 

Remark 3. Let Qdl be a class of functions such that it satisfies Dedecker and Louhichi's 
bracketing condition (j24p and (^dl;'^2) is totally bounded. Then it is easily seen that the 
pairwise sum Qdl + G■y,^l = {g + h : g E Qdl, h E ^7,^} is also a Donsker class, namely 
Proposition HI holds for this pairwise sum. ^ 



dl{g) = ess supE[/(Xi)| J'o]- 



(23) 




(24) 
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4.3 Tightness of N, 



n 



The tightness of iV„ requires the short-range dependence condition (fTBj) or (fTI?|) . The fact 
that Fn is different iable is quite useful. 

Proposition 2. Assume 17^) (resp. ( TT^) ). T/ien i/ie process {Nn{g) : G ^7,/^} (resp. 
{Nn{g) ■■ g e ^^,6}) is tight. 



Proof. Recall l\.\<rQ^,t, = {5fl|.|<,, : g E ^7,^}. For g E Q, 



\g{x)[fn{x) - f{x)]\dx 



|a:|>r 



< / g {x)w^^^^{dx) \fn{x) - f{x)\ w^+f,{dx) 

J\x\>j- J\x\>r 

< / \fn{x) - f{x)\'^W^+^{dx). 
J\x\>r 



Applying Lemma E] with A = {—00, — r) U (r, 00) and T„(^) = XliLi /^(^l-^j)' "^^ have 



E <( sup 



n I \g{x)[fnix) - f{x)]\dx 
\x\>r 



<nE< I \fnix)- f{x)\'^w^+^{dx) 

'\x\>r 



< 



E 

.j=o 



\x\>r 



which converges to as r — 00. As in the proof of Proposition ^(cf. Section^)), it remains 
to show that for fixed r, {Nn{g) : g G i\-\<rQ'y,tJ.} is tight. To this end, we apply Lemma E] 
with A = R and Tn{0) = Y17=i fei^l^j)- (fTB|) . there exists k, < 00 such that 



n / \\fn{x) - f{x)\\ w.y+f,{dx) < K 



holds for all n G M. So the asymptotic equi- continuity follows from 



lim lim sup P* I sup | A'^^ [g — h)\ > e 

„^oo I \\g-h\\^<S, g,hl^'i-\.\<rGl,^^ 



< lim lim sup P < ^/n / \fn{x) — f{x)\dx > - 



<5^o 



< lim lim sup —nl 

5^0 n-,00 
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\fnix) - f{x)\w^+^{dx) 
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0. 



It is easily seen that (l|-|<r^7,/i,, || ■ ||oo) is totally bounded [cf. (jH7|) and 
the process {Nn{g) : g G Q-y,ij.} is tight. 

The other case that g G Hri^s can be similarly proved by noticing that 



Therefore, 



sup / \g{x)[fnix) - f{x)]\dx < / - 

g&in,s J A J A 

1/2 

\fn{.x) - f{x)\'^m{dx) 



< 



/ (1 + \x\)-^\og-'^{2+\x\)dx 

J A 



1/2 



in view of the Cauchy-Schwarz inequality. 







4.4 Finite-dimensional convergence 

Lemma 1. Let gi, . . . ,gk be measurable functions such that g{Xi) G and E[gi{Xi)] = 0, 
1 < i < k. Let g = {gi, . . . , gk) ■ Assume 

oo 

5^ ||^o^7*(^m)|| < oo, l<l<k. (25) 

m=0 

Then := Em=o ^ofl'i(^m.) G and 

n 

n''\Pn - P)g = n~'/' 5^{g(X,) - E[g(Xi)]} iV(0, S) (26) 

1=1 

where J^ij = E{^gXg^) = ^^^^ cov[^i(Xo), ^^(X/)]. 

Proof. The case in which k = 1 easily follows from Hannan (1973, Theorem 1) and 
Woodroofe (1992). For > 2, we apply the Cramer- Wold device. Let Ai,A2 be two real 
numbers. Using the relation = Em=o ^ofi'i(^m), we have Ai^g^ + = ^Aisi+Azga) 

from which ()26|) easily follows with the stated covariance function. <(> 

There are many other forms of central limit theorems for dependent random variables. 
Lemma Q imposes simple and easily verifiable conditions. In addition, it also provides a 
very natural vehicle for the finite-dimensional convergence of {Mn{g) '■ g G Q-y,ij.}', see the 
proof of Theorem 01 in Section 14.11 

Gordin (1969) obtained a general central limit theorem. However, as pointed out by Hall 
and Heyde (1980, p. 130), the condition imposed in Gordin (1969) is very difficult to check. 
To overcome the difficulty, in their book Hall and Heyde proposed Theorem 5.3 (see p. 133), 
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which states that the central hmit theorem holds provided Yl'^=o^o9{-^n) converges to 
some random variable ^ in with ||^|| > and limn^oo nlKPn — P)g\\'^ W^W^- Due to the 
dependence, the verification of the latter condition could be difficult as well. In LemmaQwe 
do not need to verify the latter condition. Another related result is given in Dedecker and 
Rio (2000). A key condition in the latter paper is that J2'^=o9i-^o)^[9i-^n)\^o] converges 
in L^. Consider the linear process g{Xn) = ^^q*^*^"-*' where = (i + l)"'^, P > 1/2 
and Ei are iid standard normal. Then their convergence condition requires /3 > 3/2. In 
comparison, only needs /3 > 1. 



5 Piecewise differentiable functions 

Using the weak convergence theory for step and absolutely continuous functions, we can 
easily deal with piecewise differentiable functions. Let J > 1 be a fixed integer and Qi^ = 

{ELi^?i(-)i-<9. : gieg^,i, 9ieR}. 

Theorem 4. Let 7 > 0. Assume that E{\Xi\^~^^) < 00 and sup„ fe{u\J^o) < r hold for some 

6 > and t < 00. Further assume that a{f^^wij^^) < 00 and cr(/^,w_i_^) < 00. Then 
{\/n{Pn — P)g : g G Ql^} converges weakly to a Gaussian process. 

Proof. Without loss of generality let / = 1. For all 6* G M, the function ge{-) = 
g{-)l.<0+ g{6)l.y0 G C^^7,i for some constant < 00. By TheoremEl under the proposed 
conditions, {\/n{Pn — P)g : g G C^^-y^i} converges weakly. It then suffices to show that 
{g{0)y/n{Pn-P)l.>e ■■ g G ^^,1, e em] is tight. Recall i?„(s) = y/n[Fn{s) - F{s)]. Notice 
that (P„ - P)l.>e = -{Pn - P)1.<0. By LemmaEl g'^{e) < C^{1 + |^|)^. Hence 

{g{e)^{P^ - P)l.>, : g G G^^i, ^ G K} C {Ai?„(s) : < C^{1 + \s\)\ s G E}. 

The latter is a process indexed by both A and s. By (I76j) and (jZH)), 

lim limsupE<^ sup sup [A^i?^(s)] I = 0. (27) 
Let r,, = {(A, s) : A^ < C^(l + \s\y, \s\ < r} and T = T^. For (Ai, si), (A2, S2) G F^, 

\XlRn{Si) - X2Rn{s2)\ < | Ai - A2 1 1 -R„ (Si ) | + | A2 1 1 -Rn(si) - i?„ (^2) | 

< |Ai - A2I sup \Rn{u) \ + Cr,^\Rn{Si) - Rn{s2)\- 
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Since II sup„gR |i?n(M)||| = C(l) and -R„,(-) is tight, it is easily seen that {\Rn(s) : (A, s) G 
Tr} is also tight. By (j^Zj), the process {Ai?„(s) : (A, s) G F} is tight. Notice that 

\\rog{X^)lx,.<e\\ < \\Voge{Xn)\\ + \gm\\'Polx„<e\\ 
is summable, the finite-dimensional convergence follows from Lemma H <0 



6 Applications 

Recall (jni) for the definition of a{h,m). To apply Theorems ^ |21 and El one needs to 
verify the short-range dependence condition that a{h,m) is finite. In many important 
applications including Markov chains and linear processes, there is an cr(jF„)-measurable 
random variable y„ such that 



(2^ 



Write r„ = /(... , En-i, En) and Y* =/(..., e^, ei, . . . , where {e*)^^,^ is an iid copy 
of (£i)jgz, and h{9, J^n) = h{9, y„). It turns out that a{h, m) is closely related to a weighted 
distance between ¥„ and F*. Define 



Hm{y) 



d_ 

dy 



h{9,y) 



m{de). 



(29) 



Proposition 3. Let pm{a,b) = \ Hm'^{y)dy\. Then for n>0, 



*M|2 



(30) 



Hence we have a{h, m) < oo if 



j2\\pm{yn,Y:)\\ < oo. 



(31) 



n=0 



Proof Observe that E[h{9,Yn)\J^-i] = E[h{9,Y*)\J^_i] = E[h{9,Y*)\J^o]- Then we have 
Voh{9,Yn) = E[h{9,Yn) — h{9,Y*)\J^o]. By the Cauchy-Schwarz inequality, 

-1 2 



\Voh{9,Y^)r<\\h{9,Y^)-h{9,Y:)r<E 



dy 
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Let X{y) = Hm{y)- Again by the Cauchy-Schwarz inequality, 



\Voh{e,Yr,)\\MdO) < 



E 



d_ 

dy 



hi9,y) 



y* 



m{d9) 



E 



E 



y: 



y„ Ky) 

Y* -.2 

Hy)dy 



m{d6)dy x / \{y)dy 
Jy„ 



UY, 



So (jnni) follows. 

Let h{9,Yn) = f£{9\Yn). Then Hm{y) can be interpreted as a measure of "local depen- 
dence" of Xn+i on Yn at y. If Xn+i and Yn are independent, then fe{9\y) does not depend 
on y, and hence Hm = 0. Let 

A{y;5)= [ [fMy)~fe{d\y + s)]'m{de). 



Then ^/A{y■,6) is the weighted distance between the conditional densities of the con- 
ditional distributions = y] and [X„4.i|F„ = y + S]. Under suitable regularity 
conditions, lims^o6~'^A{y;6) = Hm{y). Therefore the function Hm quantifies the depen- 
dence of Xn+i on Yn. Intuitively, suggests short-range dependence in the sense that 
if we change Eq in Yn to 5q, then the cumulative impact of the corresponding changes in 
Yn, measured by the weighted distance pm, is finite. 

Remark 4. The random variable Y* can be viewed as a coupled version of Yn- The coupling 
method is popular. See Section 3.1 in Dedecker and Prieur (2003a) and Section 4 in their 
(2003b) for some recent contributions. <() 



6.1 Iterated random functions 

Many nonlinear time series models assume the form of iterated random functions [Elton 
(1990), Diaconis and Freedman (1999)]. Let 

Xn = R{Xn-l,en), (32) 

where e, e^, A; G Z, are iid random elements and R{-, ■) is a bivariate measurable function. 
For the process (jH^ . due to the Markovian property, (PH|) is satisfied with Yn = X„. The 
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existence of stationary distribution of ()32j) has been widely studied and there are many 
versions of sufficient conditions; see Diaconis and Freedman (1999), Meyn and Tweedie 
(1994), Steinsaltz (1999), Jarner and Tweedie (2001), Wu and Shao (2004) among others. 
Here we adopt the simple condition proposed by Diaconis and Freedman (1999). Let the 
Lipschitz constant 

_ \R{x,e)~R{x',e)\ 

J-'e — sup j — — . 

Assume that there exist a > and Xq such that 

E[L^ + |xo - i?(xo,e)r] < oo and E[log(L^)] < 0. (33) 

Then there is a unique stationary distribution (Diaconis and Freedman, 1999). The lat- 
ter paper also gives a convergence rate of an arbitrary initial distribution towards the 
stationary distribution. Wu and Woodroofe (2000) pointed out that the simple sufficient 
condition (jHHj) also implies the geometric-moment contraction: There exist /3 > 0, r G (0, 1) 
and C < oo such that 

E[|F(. . . , £o, . . . , Sn) - F(. . . , 4. ^1. • • • . ^n) < Cv^ (34) 

holds for all n G N, where (efc)fcgZ is an iid copy of {£k)keZ- Hsing and Wu (2004) and 
Wu and Shao (2004) argued that (|34j) is a convenient condition to establish limit theo- 
rems. Dedecker and Prieur (2003a, b) discussed the relationship between (|34j) and some 
new dependence coefficients. Recently Douc et al (2004) considered subgeometric rates 
of convergence of Markov chains. In their paper they adopted total variational distance, 
while (j34j) involves the Euclidean distance. 

Bae and Levental (1995) considered empirical central limit theorems for stationary 
Markov chains. It seems that the conditions imposed in their paper are formidably re- 
strictive. Consider the Markov chain Xn+i = aXn + £n+i, where < |a| < 1 and 
Ei are iid standard normal random variables. Then the transition density is q{u\v) = 
(27r)^^/^ exp[— (m — avY/2] and the chain has the stationary distribution A^[0, (1 — a^)~^]. 
Let a{u) = (27r)~^/^(l — a^)^/^ exp[— n^(l — a^)/2] be the marginal density. It is easily seen 
that sup^„g]g[g(u|f )/a(n)] = oo. Conditions (1.1) and (1.2) in Bae and Levental (1995) re- 
quire that this quantity is finite. For this process, by TheoremEl {y/n{Pn — P)g : g E G-y,^} 
converges weakly to a tight Gaussian process for any 7 > and /i < 1. 
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6.1.1 AR models with ARCH Errors 



Autoregressive models with conditional heteroscedasticity (ARCH) have received consid- 
erable attention over the last two decades. For recent work see Berkes and Horvath (2004) 
and Straumann and Mikosch (2003), where some statistical inference problems of such 
models are considered. Here we consider the model 



= aA„_i + £„y'a2 + h^Xl_^, (35) 

where {£i)i^z ii*^ random variables. In the case 6 = 0, then (j35|) is reduced to the 
classical AR(1) model. U b ^ 0, then the conditional variance of X„ given A„_i is not a 
constant and the model is said to be heteroscedastic. We assume without loss of generality 
that b = 1 since otherwise we can introduce a' = a/b and e[ = Si/b. Let R{x,e) = 
ax + ey/a^ + x^. Then < sup^ \dR{x, e)/dx\ < \a\ + Assume r = E[(|a| + \e\)^] < 1 
for some (3 > 0. Then (jSHI) holds and there is a unique stationary distribution. It is easily 
seen that ()34|) holds with this jS and r, and E(|Ai|'^) < oo. 

We now compute Hyj^{y). Denote by the density function of Si. Write u = u{9, y) = 
(6 — ay)/ + y"^. Then the conditional (transition) density fe{0\y) = (a^ + y"^)"^^"^ feiu) 
and 

d , X _ yfeju) afXu) {9 - ay)yf^{u) 

dy^'^^^' (a2 + y2)3/2 ^2 + ^2 + (a2 + y2)2 • ^ ! 

Assume that := ^^f'^{t)w^{dt) + ^^\f'^{t)\'^w^^2{,dt) < oo. Elementary calculations 
show that 

f^{uie,y))w,{de) = I pi J_ \{l + \v + ay\rdv 

Jm. \\/a? + y^j 

< C^{1 + \ay\y{a^ + y^f'^K^ + C^(a' + y^)^^^''^'^ k^, 
\au{0,y))?yo,{dd) < C,{l + \ay\)\a^ + y^f''K, + C,{a^ + y^)^'^^^'^K, 

and 

{6 - ayf\f,{u)\^w^{de) < C^{1 + \ay\y{a^ + y^fn^ + C^{c? + y2)(3+7)/2^_^_ 
Combining the preceding three inequalities, we have by that 



|/.(^l.)' 



w^{dy)<Ca,^^^{l + \y\)-'+\ (37) 
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Theorem 5. Let E[(|a| + l^l)'^] < 1 for some 7 > and 



fl{t)w.,+,{dt) 



|/;(t)|X+3(c?i) < 00. 



(3^ 



Then there exists G (0, 1) such that p^^^^iYn, Y*) = (9(0") and hence {^/n{Pn—P)g : g E 
Q-y^i} converges weakly to a tight Gaussian process. 

Proof. Let r = E[{\a\ + |5|)^]. By dSH), E(|F„, - < Cr" for some constant C. By §7^, 

H^^+Av) < C(l + \y\V^^- Let A = 7/2. If < A < 1, by LemmaEl 



il + \y\r-'dy 



0[E{\Yn-Y:\^^)]'/^ = 0(r"/2). 



If A > 1, let p = A/(A — 1). By Holder's inequality, 

E[(i + |r„|)2(^-i)|r„-y;|2] < {E[{i + \Yn\)''~'-'^^]y/^ x {E[\Yr,-Y:\'']y/' 

= 0[(r")i/^]. 



Hence 



il + \y\V~'dy 



Yr,. 



< m + \Yn\)'^' + ii + \Y:\)'~']\Y^-Y:\r 

< 4E[(1 + |F„|)2(^-i)|F„ - Y:\'] = 0[(r")i/^]. 



Let = max(r^/^, r^/*^^^)) < 1. Then = 0(0"'), and the theorem follows from 

(jnH), Proposition ini and (i) of Theorem El <!) 

Lemma 2. Lei < A < 1. T/ien /or a// u, v E R, \ /"(I + \y\)^^^dy\ < 2^-^\u - v\^/X. 

Proof. It suffices to consider two cases (i) u > v > and (ii) u > > v. For case (i), 

' 1 1 

(1 + \y\r-'dy = j[{l + u)' - (1 + v)'] < j{u - v)\ 

For the latter case, let t = u — v. Then 



(1 + \y\)'~'dy = ^[(1 + n)^ - 1 + (1 + |'.|)^ - 1] < ^[2(1 + 1/2)^ - 2] < 



)1-A 



A 



and the lemma follows. 



A' 
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6.2 Linear processes 

Let Xt = Xli^o ^i^t-i, where e^, A; G Z, are iid random variables with mean and finite and 
positive variance, and the coefficients (ai)j>o satisfy Xli^o*^? ^ Assume without loss of 
generality that oq = 1. Let and fe = be the distribution and density functions of Ek- 
Then the conditional density of Xn+i given JF„ is fe{x — Yn) and ()28p is satisfied, where 
Yn = Xn+i —En+i- Limit theorems for short and long-range dependent linear processes are 
presented in Sections 16.2.11 and 16.2.21 respectivelv. 



6.2.1 Short-memory linear processes 

Proposition 4. Let 7 > 0. Assume E(|efcp+'^) < 00 and 



K 



\f'^{u)\'^Wy{du) < 00. 



Then p^^(r„,r„*) =0(|a„+i|). 

Proof. Using the elementary inequality 1 + + < (1 + |t'|)(l + \y\), we have 



\f^(u-y)ni + \u\rdu 



< {l + \y\r / \av)\'w,{dv) = nil + \y\r 



and 



HwJy)dy 



\y\y^'dy 



< v^[(i + + (1 + - y:\ 



(39) 



Observe that Yn and Y* are identically distributed. Then 

P.,{Yr,,Y:) < 2v^||(i + |K|)^/2|y„-r;||| 

< 2^\an+i\[m + Kir^'eoW + 11(1 + \Yn\y/%\\. 

Since E(|efc|2+^) < 00, it is easily seen that || (1 + |F„|)^/Vo|| = C(l) and || (1 + = 
0(1). So the proposition follows. <C> 
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Remark 5. By Proposition HI if 7 = /i = 0, Ek has second moment and J]^[fe{'t)Ydt < 00, 
then ([T^ is equivalent to 

00 

^|a„|<oo, (40) 

n=l 

which is a well-known condition for a linear process to be short-range dependent. 

By Theorem ini Propositions El and EJ we have 
Theorem 6. Let 'j > and < /i < 1. Assume ^U^, E(|£:ip+'^+^) < 00 and 

[ \f',{u)\'w,+,{du) < oc. (41) 

Then {^/n{Pn — P)g : g G ^7,^} converges weakly to a tight Gaussian process. 
Corollary 2. Let 7 > 0. Assume E(|eip"'"'^) < 00, sup„/j(n) < 00 and 



\f^{u)\'w,+,{du) + / |/;(w)|^rfw<oo. 

m 

Then {i?„(s)(l + \s\y^^, sER} converges weakly to a tight Gaussian process. 

The corollary easily follows from Propositions El ID and Corollary ^ We omit the details 
of the proof. 

6.2.2 Long-memory linear processes 

Let Oo = 1 and a„ = n~^i{n), n > 1, where 1/2 < f3 < 1 and £ is a slowly varying 

function (Feller, 1971, p. 275). Then the covariances are not summable and we say that 

(Xt)^g2 is long-range dependent or long-memory. Let i^' be a measurable function such that 

-^^00(2;) := ^[K{Xi + x)] is in C^, the class of functions having up to pth order derivatives; 
let alp = n2-P(2/3-i)^2p^^) 



Sn{K;p) = J2 



1=1 



where Un,r= ^ J^a^^e^.j^. (42) 

0<jl<...<jr S = l 



Ho and Hsing (1996, 1997) initiated the study of such expansions, which are closely related 
to chaotic representations, Volterra processes (Doukhan, 2003) and Hermite expansions if 
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Xn is Gaussian. Here we consider the weak convergence of {Sn{K;p) : K G /C^} for 
the class /C^ = g*^^] recall ^ for the definition of g*^^^^. If (1 + p){2[3 - 1) < 1, then 
the hmiting distribution of {cr~p^^S'„(Jr; p) : K E JCy} is degenerate since it forms a line 
of multiples of the multiple Wiener-Ito integral (cf. Corollary 3 in Wu (2003a)). 

See Major (1981) for the definition of multiple Wiener-Ito integrals. To have a complete 
characterization, one needs to consider the case in which (1 +p){2(3 — 1) > 1. It turns out 
that, with the help of Proposition Q we are able to show that the limiting distribution is 
a non-degenerate Gaussian process with a y^-normalization. 
Recall Yn-i = X„ - Let S'^{y;p) = XlLi y), where 



p 



L^J^n, y) = F,iy - F„_i) - F{y) - 5^(-l)^F«(y)f/„,. (43) 



i=l 



Let Lp{Tn,y) = [lx„<y - F,{y - Y^-i)] + L^Tn^y), /°(J-„,y) = 9L0(^„, y)/9y, sl{y-p) 
Y^Li^pi^i^y) and Sn{y;p) = Y^Li^pi^i^v)- 

Theorem 7. Assume E(|£:i|^"*"''') < 00 for some 7 > 0, G C^~^^ and 

P+i 



E / l/f^(^)lX(^^)<oo. (44) 



(i) Ifp < {213 -ly^ - 1, then 



-^{SniK;p) : K G /C^} ^ {At+')(0) : K G /C,}Zp+i,^. (45) 

(a) If p > (2/5 — 1)~^ — 1, then {n~^^'^Sn{K;p) : K G /C^} converges weakly to a tight 
Gaussian process. 

Proof As in Wu (2003a), let = |a„_i|[|a„_i| + ^^'(4) +A^/'(2)] and e„,p = ELi ^Kv, 
where Anik) = Xli^n k*!'^- By Karamata's theorem (Feller, 1971, p. 281), An{k) = 
0{n\an\''), k > 2. (i) It follows from Theorem 1 and Corollary 3 in Wu (2003a). (ii) Since 
(l+p)(2/5-l)>l, 

00 00 

Y,On,p = 5^0^ + Klinaiy/' + Klinaly/'] < 00. (46) 

n=l n=l 
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By Lemma 9 in Wu (2003a), the condition ^^=o/r \f£^\^)\'^'^'yid,x) < oo together with 
E(|£:i|'^+''') < oo imphes that 

/ \\ViL{J^n,t)fw'y{dt) = 0{9lJ. (47) 
Jr 

Using the same argument therein, it can be shown that ()44j) entails the similar result 

/ \\P^fiJ'n,t)fw,{dt) = Oiel^). (48) 
Jr 

By Lemma 8 in Wu (2003a), E(|£i|'^^^(i+^'2)) < oo and (gll) imply that Sn{K;p) has the 
representation 

Sn{K-p) = - [ K'{x)Sn{x]p)dx = - V / K'{x)Lp{J^i,x)dx. (49) 

Jr T^j^ 

Combining and (pUj) . since K E JC-y, 



\\Vo [ K\y)L{J^^,y)dy\\ < J [ [K'{y)Yw^,idy) [ ||Po^(-^n, t)|IX(^*) 

are summable and by Lemma Q the finite-dimensional convergence follows. 

We now use the truncation argument as in the proofs of Propositions ^ and [21 to 
establish the tightness. Since VoL{J^n,t) = PoL^{J^n,t) for 7^ 0, (gZI) implies that 
/jg \\ViL°{J^ri,t)\\^Wy{dt) = 0{elp). By LemmaHand (gHI), we have 

1 



lim limsup— / \\S^{x;p)\\'^Wy{dx) = 0, 

n^oo n J\x\>r 



and 



limsup— / ||s„(x;p)|| Wy{dx) < oo. 
Applying the maximal inequality (j^H)) of Lemma El with yU = 0, 



(50) 



(51) 



sup|5°(x;p)p(l + |x|)^ <- / [\\S:{x;p)r + K{x;pmw,{dx) = 0{l). (52) 

Note that ^„,(x;p) = n[F„(x) - F„(x)] + S^{x;p). By (gHI), 

Sn{K-p) = - I K'{x)n[Fn{x) - F{x)]dx - [ K' {x)S^{x;p)dx 
Jm Jm 
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= ^/^M^{K)- f K\x)Sl{x]p)dx- I K'{x)S^{x;p)dx. 

J \x\>r J —r 

Since E(|Xi|'^+^) < oo, by (i) of Proposition [U the process {Mn{K) : K G K,^} is tight. 
For the second term, by the Cauchy-Schwarz inequahty, 

1 

n 



K' {x)S^{x] p)dx 

|a::|>r 



< / [K'{x)fw.,{dx)x- [ \\S'i{x;p)fw,{dx), 

\x\>r ^|x|>r 



which converges to by first letting n — >■ oo and then r — > oo. Now we deal with the last 
term. Using integration by parts, 

K'ix)S'^ix;p)dx = f K{x)sl{x-p)dx - K{r)S'^{r;p) + K{-r)S'^{-r;p) 



As in the proof of Proposition |21 (f^ implies that {n J^^ K{x)s^{x]p)dx, K E JC^} is 
tight. By (jSai, \\S^{±r;p)\\ = 0{^). Since \K{x)\^ < C^{1 + \x\)^+\ it is easily seen 
that {n~^/'^K{±r)S^{±r;p), K E /C^} is also tight. 

It is unclear how to generalize Theorem [7| to long-range dependent heavy-tailed pro- 
cesses, linear fields (non-causal) and other long-range dependent processes. The special 
case p = 1 is considered in many earlier results; see Giraitis and Surgailis (1999), Doukhan 
et al (2002) and Doukhan et al (2004). Limit theorems for heavy-tailed processes are pre- 
sented in Hsing (1999), Koul and Surgaihs (2001), Surgailis (2002), Wu (2003b) and Pipiras 
and Taqqu (2003). It seems that there is no easy way to establish pth order expansions 
for p > 2. 

7 Inequalities 

The inequalities presented in this section are of independent interest and they may have 
wider applicability. They are used in the proofs of the results in other sections. 

Lemma 3. Let H E AC. (i) Let /i < 1 and 7 G M. Then there exists C^^^ < 00 such that 
snv[H\x){l + |x|)^] < C^,^ I H\u)w^.^{du) + C^,^ I [H' {u)fw^+^{du). (53) 

(11) Let 7 > 0, /i = 1 and H{0) = 0. Then 

w_^^i{du) (54) 
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and 

I H\u)w^^^,{du) < \ I [H\u)f W-^+i{du). (55) 

(in) Let 7 > and H{±oo) = 0. Then 

sup[H\x){l + \x\y] < - [ [H'{u)fw^+i{du) (56) 
xpM. 7 JR 



and 



[ H\u)w^.^{du) < \ I [H'{u)fw^+^{du) 
Jm. 7 JR 



(57) 



Proof, (i) By Lemma 4 in Wu (2003a), for t G R and 5 > we have 

2 rt+S rt+5 

sup H\s)<- H\u)du + 26 [H'{u)]^du. (58) 

t<s<i+<5 Jt Jt 

We first consider the case /i < 1. Let a = 1/(1 — //). In (jSHI) let t = tn = and 

(5 = 5„ = (n + 1)° — n"", n G N and J„ = tn+i]. Since 5„ ~ an""^ as n — 00, 



sup[i72(x)(l + x)T] < 2sup(l + x)T 



j H\u)du + 6n f [H\u)]^du 



< C H\u)w^_^{du) + C / [H'{u)]^w^+^{du). (59) 

Jin Jin 

It is easily seen in view of fl58p that also holds for n = by choosing a suitable C. 
By summing over n = 0, 1, . . ., we obtain with sup^g]^ replaced by sup^>o- The 
other side x < similarly follows. 

If /i = 1, we let tn = 2*^, Sn = Wi - tn = tn and 4 = = 0, 1, . . .. The 

argument above similarly yields the desired inequality. 

(ii) Let s > 0. Since H{s) = H'{u)du, by the Cauchy-Schwarz inequality, 
follows from 

rs rs 

H^is) < / \H'{u)\^{l + uY~^'dux / {l + uy~^du 
Jo Jo 

< [ [H'{u)]^w..,+i{du) X 

JR 7 

Applying Theorem 1.14 in Opic and Kufner (1990, p. 13) with p = q = 2, the Hardy-type 

inequality ()55j) easily follows. The proof of (iii) is similar as (ii). <0 
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Lemma 4. Let m be a measure on R, A G R a measurable set and Tn{6) = Yl^=i h{9,J^i), 
where h is a measurable function. Then 



j=0 



(60) 



Proof. For J = 0, 1, . . . let T^M = T^tiAKO, :F{)\T,^,] and \] = \\Vohie, :Fj)fm{de), 
Xj > 0. By the orthogonality of E[h{e, J^i)\J^i^j] - ^[h{e, z = 1, 2, . . . , n, 

/ ||T„,,(e)-T„,,_i(e)fm(rf^) = n[ \\E[h{e, J^i)\J^i^,] - E[h{e, J^^)\J^_,]fm{de) 

J A J A 

= nj \\Vi^jh{e,J^i)fm{d9)=n\^j. 
Note that T„(6') = Tn^o{6). Let A = Yl^o ^j- Cauchy-Schwarz inequality, 



< A 
and (ISUj) follows. 

Lemma El easily follows from Burkholder's inequality. We omit the proof. 

Lemma 5. Let (-D.t)i<.t<n be L'^ (q > 1) martingale differences. Then 







\Di + ... + D„||f "("^''^ < [18g3/2(g - i)-i/2]min{9,2) J2 \\Di 



I min((ir,2) 
\q 



(61) 



i=l 



Lemma IHl below gives a simple maximal inequality. Weaker and special versions of it 
can be found in Doob (1953), Wu and Woodroofe (2004) and Billingsley (1968). It has the 
advantage that the dependence structure of {Zi} can be arbitrary. 

Lemma 6. Let q > 1 and Zi, 1 < i < 2'^, be random variables in C^, where d is a positive 
integer. Let Sn = Zi + . . . + Zn and S* = maxi<„ 15*4 1. Then 



\s;4U<Yl 



r=0 



2d-r 



7 ^ ||'S'2'-m ~ S2r(^rn-l)\\l 



m=l 



(62) 
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Proof. Let p = q/iq — I) and A = J2t=o -^r ^5 where 



Xr 



-1)11? 



m=l 



P+9 



For the positive integer k < 2'^, write its dyadic expansion k = 2'"^ + . . . + 2^-', where 



< rj < . . . < ri < d, and k{i) = T'^ 



T'K By Holder's inequahty, 



< 



i=l 

3 



i=l 



q/p 



i=l 



j 2" 



< A^/^5^A?,. J2 \S2nm-S2r. 



(m-l)l 



i=l m=l 
d 2''-'' 



< A'^/^5^A?5^|^2'-™-^2' 



(m-l)l , 



r=0 m=l 



which entails < A'?/?' ^^^^ A^A-p-« = A" and hence 



0. 



8 Proofs of Theorems [T] and |21 

Following ®, let G'„(s) = - and g„(s) = ^^/^[^^(a;) _ f{x)\. Then 

= Gn{s) + Qn{,s). Sections l8. II and IS^ deal with and Qn respectively. Theorems 
HI and El are proved in Sections 18.31 and lOl 



8.1 Analysis of G„ 

The main result is this section is Lemma IHl which concerns the weak convergence of Gr, 
Lemma 7. Let q >2. Then there is a constant Cq < 00 such that 



+Gq{y - X 



,9/2-1 



E[f^/\u\J^o)]d 



u 



(63) 
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holds for all n E N and all x < y, and 

\\Gn{x)\\l < C,mm[F{x),l-F{x)]. (64) 

Proof. Let di{s) = lx,<s -H'^x,<s\^i^i), di = di{y)-di{x) and A = df -E.{dl\Ti^i). Wu 
(2003a) deals with the special case in which Xj is a linear process and g = 4 [cf. Inequality 
(48) therein]. Let q' = q/2. By Burkholder's inequality (Chow and Teicher, 1978), 



\\G„iy) - Gn{x)\\l = n~'^/^E{\di + ... + d^n 
< C,n-^/2E[(rf? + . . . + 



i=l 



i=l 



(65) 



Since Di, i E Z, form stationary martingale differences, by Lemma we have 



< C,r2--(i'^'/2)2.'-i[||rf2||.; _^ ||E(d2|^o)|i:;] 

< C,n'""^(^'^'/')2^'||d?r;, 



(66) 



where we have applied Jensen's inequality ||E(d5|^o)||g/ < IMillq'- Notice that \di\ < 1, 

wdiwi < iMiii:; < 2^^'-Miii.<x.<,ii:; + iie(i.<x.<,i-^o)ii:;] ^ 2^'[^(^) - ^(^)]- (67) 

On the other hand, since q' > 1 and E((i^|jFo) < E{lx<Xi<y\^o) , we have by Holder's 
inequality with p' = q' / {q' — 1) that 

< n^'Mdiimi 



E 



J2^{d^\^^-l] 



1=1 



< E 



fs{u\J^o)du 



(y-xy'/^' / /|'(u|^o)c?« 



(68) 



Combining ((23), (ESI), (EH) and dHHl), we have 

To show we let in that (ij = di{x) = lxi<x — ^{'^Xi<x\^i~i)- Then 

< Cgn^/^rfif < C,n''/2F(x)[l - F(x)] 
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completes the proof. 







Lemma 8. Let q > 2. Then there exists a constant Cq < oo such that 



E 



sup \Gn{a + s)- Gn{a)\'^ 

0<s<b 



< C,d%"^'^"(^'''/^)"'?/2[F(a + 6) - F(a)] 

ra+b 

C,6^^/'"^[l + n^/'2'^(^^^/2)] / E[/|/2(n|Jo)]rfn (69) 

J a 



holds for all b > , a E R and n, d G N. In particular, for d = 1+ [(logn)/[(l — 2/q) log 2] J , 
we have 



E 



sup \Gn{a + s) - Gnia)]"^ 

0<s<b 



^g^max(l,q/4)-<j/2 J 
pa+b 

+ G^b"/'-' I E[f^/\u\J^o)]du. 



< C5(logn)%°^^'^^^''?/^^-'?/"[F(a + b) - F{a)] 

na+b 



(70) 



Proof Let h = b2''^, = Gn{a+jh)-Gn{a+{j-l)h), j = 1,...,2'^ and Sj = Zi + .. . + Zj. 
By Lemma El 



||'S'2'-m ~ 'S'2'-(m-l) \\q 



< + 2"m/i) - F{a + 2"(m - l)h)] 

na+2^mh 

+C,(2^/i)^/2-i / w.[fl/\u\T,)]du 

Ja-\ 



/a+2''(m-l)/i 



Hence 



2d- 



J2\\S2rm-S,r^rn^i)\\l < C,n--^(^'^/^)-''/2[F(a + 6)-F(a)] 

m=l 

+Gqirhy/'-' r\[ff'iu\j'o)]du. 

J a 



By Lemma ini 



n 



max{l,(jr/4)-g/2 



r=0 



[F(a + 6)-F(a)]}i/« 



)]du 



+j2{Gq{rhr/'~' r' E[f^/\u\j^o 

+ |c,(2'^/z)^/2-iy E[/,«/2(m|^o)]c^« 



1/9 



(71) 
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Recall Fnix) = n-^ ELi F,{x\J^i^i). Let Bj = v^[F„(a + jh) - F^a + (j - l)h)], j 
1, . . . , 2^^, and q' = q/2. Since < < 1, by Holder's inequality, 

9 



Therefore, 



ra+jh 

/ fe{u\Ti-i)du 

Ja+{j~l)h 



i=l 



a+jh 

lJa+{j-l)h 
a+jh 



fe{u\Ti-i)du 



I- a+jh 
J a+{j-l)h 



E 



max 5^ 



< E 



2d 



ra+b 
J a 



Observe that 



(72) 



Gn{a + h[s/h\) — max 5,- < Gn{a + s) < Gn{a + h\s/h + Ij) + max 5,. 
Hence ^ follows from (f7T| . (ff2|l and 

sup |G„(a + s) - Gn(a)| < sup \Gnia + h[s/h + 1\) - Gn{a)\ 

0<s<b 0<s<b 

+ sup \Gn{a + h[s/h\) — Gn{a)\ + ^maxBj 
< 2 max |G'„(a + /ij) — Gn{a) \ + 2 max S,- 
= 2Sld + 2 max Bj 

by noticing that h = 2^%. 

For = 1 + [(logn)/[(l - 2/q) log2]J, we have n^/22'^(i-'?/2) < i and hence ^ is an 



easy consequence of 



Lemma 9. Let ^ > and q> 2. Assume E[|Xi|'^ + log(l + |Xi|)] < oo and Then (i) 



E 



sup|G„(s)|''(l + |s|r 



0{l) 



(73) 
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and (ii) the process {G'„(s)(l + s G R} tight and it converges to a tight Gaussian 

process. 



Remark 6. In Lemma IHl the logarithm term log(l + \Xi\) is not needed if 7 > 0. 



Proof, (i) Without loss of generality we show that E[sup^>Q + = 0{1) since 

the case when s < follows similarly. Let q;„ = (logn)''n™''''(^'^/'^)"''/^. By (^^ and ((70)) of 
Lemma IHl with a = b = 2^^, 



sup |G„(s)-G„(2^)|^ 



^(1 + 2'=)^E 

k=l 

00 
k=l 

+C, V(l + 2'y{2y/'^' / E[f^/'{u\J^o)]du 

<C^,gan fiu){l + uydu + C^,g {l + uyu'^/^~^E[f^/^{u\J^o)]du 
J2 J2 



<C^,gan + C^,g = 0{l). 



(74) 



Let A^{d) = ELi(1 + 2'')^- Then A^dhg^v] + 1) < C{l + v)^ if 7 > and A^dhg^v] + 
1) < log2 f if 7 = 0. By (|64p of Lemma [7[ we have 



00 00 „oo 

J](l + 2^n|G„,(2^)l|^ < Y.^l + 2'rcJ f{v)dv 

k=l k=l '^■^'^ 

< Cq \ f{v)A^{}^og^v\ + \)dv < 00. 



(75) 



Thus dZni) follows from (fTIj) and (f7K|l . 

(ii) It is easily seen that the argument in (i) entails 



lim limsup] 



sup|G„(s)r(l + |s|)^ 

\s\>r 



(76) 



For 6 e (0, 1) let the interval 4 = 4(5) = [M, {k + 1)5]. Then 

sup |G„(s)(i + isir^'^ - Gnmi + \t\r/'^\ 

s,t£l-r,r], 0<s-t<5 

< sup |(l + |s|)^/«[G„,(s)-G„,(t)]| 

s,t£[-r,r], 0<s~t<5 
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+ sup |G„(t)[(l + |s|r/''-(l + |t|r/'']| 

s,t£[-r,r], 0<s~t<5 

Kil + ry/" sup \Gnis) - Gnit)\ + Cr,^^g6 sup \Gn{u)\ (77) 

s,tG[— r,r], 0<s—t<5 itS[— r,r] 

By (i), II sup^g]^ |G'rt(M)||| = 0{1). On the other hand, by Lemma |H1 



Lr/<5J+1 

k=-[r/5\~-l 

Lr-/5J+1 



sup |G„(s) - Gnik6)\ > e 



fc=-[r/5J-l ''^k J 

JR 



By dZD, j^¥.[fl''^{u\TQ)]du < oo. Hence 



hm sup : 



sup \Gn{s)-Gn{t)\>2t 

s,te[-r,r], G<s~t<S 



2-1 



which imphes the tightness of {G'n('S), — r < s < r} for fixed r. So (ii) follows from (fTBj) 
and (dZj). 



8.2 Analysis of Q„ 

It is relatively easier to handle Q„ since it is a differentiable function. The Hardy-type 
inequalities (cf Lemma Ej) are applicable. 



Lemma 10. Assume M^) . Then (i) 



sup 



|g„(.)p(i + |s|)2^/« 



LsGlt 



0{l) 



and (ii) the process {Qn{s){l + |s|)'''/'^, s G M} is tight. 
Proof. Let 7' = 27/g. (i) By (f^ of Lemma IHl 

sup[Q^(.)(l + |.|)^'] <-[ [Q'Mfw.^Ads). 

\s\>r 7 J\s\>r 
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By Lemma m 



sup|g„(s)|(i + |s|r'/2 

\s\>r 



< 



ttEW / \\Vofeie\j-,W^.'+iide). (79) 

/7 ^ V J\s\>r 

So dZHl) follows by letting r = in TH^ . 

(ii) The tightness follows from the similar argument as (ii) of Lemma El Let < 6 < 1. 
Then 

sup \Qnis){l + \s\r/''-Q4t){l + \t\r/'^\ 

s,tG[-r,r], 0<s-t<S 

< sup \{l + \s\y/''[Qn{s)-Qnm 

s,te[-r,r], 0<s-t<S 

+ sup |g„(t)[(l + |,|)7/._(l+|t|)7/.]| 

s,te[-r,r], 0<s~t<5 

<Cr,-y,q6 sup \Q'^{u)\ + Cr,^^q6 SUp \Qn{u)\ 
r,r] ?iS[— r-jr] 

Notice that || sup^gju |(5n(M)||| = By of Lemma El and Lemma El 



E 



sup|g;(.)p < Cy / ||g;(s)||V+y(rf^) + Cy / ||g'^(s)ft._i_,,(rf3) 

Then there exists Ci < oo such that for all n G N, 



E 



sup |g„(s)(l + Isl)-^/'? - Qn{t){l + \t\r/'^\'- 

!,tef-r,rl, 0<s-t<5 



Notice that the upper bound in ()79|) goes to as r ^ cxd. Hence (ii) obtains. 







8.3 Proof of Theorem [B 

Observe that {d/de)VoF,{e\J^j) = Vofe{0\J^j) and VoFe{e\J^j) = when 9 = ±oo. By §^ 
of Lemma |Sl 



^sup||Poi^,(^|^,)ll < Vg/(27)^(A,«^i4-2W,) < oo, 
1=0 " 



Hence by 



which by Lemma^entails the finite-dimensional convergence. Since Rn{s) = G'„(s)+g„(s), 
the tightness and follows from Lemmas IHl and ^1 <C> 
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8.4 Proof of Theorem ^ 

Note that (iogn)%"^^^(i'3/4)-''/2 = (logn)%i-«/2 ^ 0{6l^^^^). By ^ of Lemmall under 
the proposed condition we have uniformly in a that 



E 



sup \Gnia + s) - Gn{a)\'^ 

0<s<S„ 



< C,(log« n)n"^'^^(^''^/^)-'^/'[F(a + 5„) - F{a)] 



+ C^S^/'-'r'^/'-' J f{u)du 
< C5^J^~'[F{a + sl)-F{a)]. 

Here the constant C only depends on r, 7, q and E(|Xi|'''). Hence 



keZ 



sup \Gnik6n + S) - Gn{k5n) 



0<s<5 



< 



Let = (1 + k)S\. Since < 5 < 1, we have 



1 1 . 1+ i 

o < F < inf ^ , ', < sup ^ , ', < 1 + 5 < 2 



and 



sup \Gn{t + S) ~ Gn{t)\ < sup \G n{t + s) - G n{k6n] 

teIk{Sn),0<S<Sn te/fc(5„),0<S<<5„ 

+ sup \Gn{k6n) - Gn{t)\ 

teh{Sn),0<S<5„ 

< sup |G'„(M„, + m) - 

0<M<2<5„ 

+ sup \Gn{k6n + S) - Gn{k6n)\ 
0<s<S„ 

< 2 sup \Gnik6n + U) - Gn{k6n)\ 

0<M<2(5„ 



Therefore, 



E 



SUp(l + |t|)^ sup \Gn{t + S) - Gnm- 

IteR o<s<s„ 



fceZ 



sup {i + \t\y sup |G„(t + s)-a„(t)r 



<C5^(l + |M„|rE 
fceZ 



sup |G'„(/c(5„ + s) -^^(M^)!^ 

0<s<2<5„ 
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Note that -R„(s) = G'„(s) + Qn{s). Then flT^ follows if it holds with i?„ replaced by G„ 
and Qn respectively. The former is an easy consequence of the preceding inequality and 
Jensen's inequality. To show that (|T!^ holds with i?„ replaced by Qn, let 7' = 27/g. By 
(j53p of Lemma El and Lemma EJ 



E 



sup(l + |a;|)^'|Q;(x) 



00, 



which entails that 



E sup 

teM. 



:i + \t\y' sup \Qn{t + S) - Qn{t)\' 

\s\<S„ 



< sup 

tel 



[i + \t\y' sup mt + s)\' 

\s\<Sn 



<csiE\snp[ii + \x\r'mxm\ = ois', 



)1) 



and completes the proof. 







Remark 7. It is worthwhile to note that the modulus of continuity of G„ has the order 



5. 



1-2/q 



while that of Qn has a higher order 6n- 







9 Proof of Proposition [T] 

We shall adopt the truncation technique to deal with M„. For r > define the function 

9h\>r by {gl\.\>r)ix) = g{x)l\a,\>r and 

n 

Mn{gl\.\>r) = n-'/' J2{giXk)l\x,\>r - E[g{X,)l\x,i>r\:Fk-i]}- (80) 

k=l 

The function 5'l|.|<r and the process Mn{gl\.\<r) are similarly defined. Since Mn{g) = 
Mn{gl\.\>r) + Mn{gl\.\<r), the tightness of {Mn{g) : g G ^7,/^} follows from Lemmas [TT] 
and^l To see this, for any 6,7] > 0, by Lemma ^2 there exists r > such that 

limsupP* \ sup |M„(^?1|.|>,,)| > 7 i < 7- (81) 
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By Lemma 1121 there exists Ui, . . . ,Ui with I < oo such that l\-\<rQ'y,^i C U^^iUi and 

hmsupP* j max sup \Mn{g - > 7I < (82) 
Let Tj = {/i + pl|.|>r : h E Ui, p E ^7,/^}, 1 < « < Then ^^^^ C Ul^^Ti and we have 



hmsupP* < max sup \Mn{g — h)\ > 6 

< hmsupP* I max sup \Mn{gl\.\<r — ^lH<r)| > 7: 
+ hmsupP* I max sup \Mn{gl\-\>r — hl\.\yr)\ > ~| 

< hmsupP* I max sup \Mn{g — h)\ > - 

+ hmsupP* i sup |M„(^1|.|>,)| > ^ i < ^ 

in view of (jHl} and (|82|) . Thus by definition {M„((7) : G ^7,^} is tight since / < 00 and 
S and ?7 are arbitrarily chosen. 

The finite-dimensional convergence is a direct consequence of the martingale central 
limit theorem. The case when Q = 'Hn,5 can be similarly proved. ^ 

Lemma 11. (i) Assume ¥.{\Xi\"') < 00, 7 > 0. Then 

limlimsupE*i sup |M„(5(l|.|>t)|H = 0. (83) 
(a) Under conditions of (ii) of Proposition^^ 

lim lim sup E* <^ sup \Mn{gl\.\>t)\\ = 0. (84) 

Proof, (i) First assume /i < 1. We shall generalize the argument in Gine and Zinn (1986). 

Let a = (1 — fi)^^, gr{x) = g{x)lx>r", r E N, and the interval Ij = (j", (j + 1)°"]. Write 
Mn{gr) = An{g; r) + Bn{g; r), where 

00 n 

A4g;r) = n-'/'Y.Y.^l9{Xk)-g{jnnx,ei,-n{9{Xk)-g{jn)^^^^^^ 

j=r k=l 
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and 



j=r k=l 

Let Z,^n = -E(lx,e/,|^fc-i)]. Then < P(Xfc E I,) =: p,. By the 

Cauchy-Schwarz inequahty and of Lemma 01 

2 



E 



sup J2\g{r)z,j\ < e\ sup Y.9\r)i^ 



j=r 



cxj 



Since r) = E.'r and E[|Xi|i/")"7] = e(|Xi|t) < cx), 



hm sup hm sup E* 

r— ►oo n— >oo 



sup |5„(5(;r) 



< C^,^hmsup^(l = 0. 



< 



We now deal with An{g:,r). Clearly ()83|) follows if the preceding inequality also holds 
for An{g]r). To this end, let Sn{u) = ^^""^^^ ELi[l^fe6J(«) - H'^Xk£J{u)\^k~i)], where 
J{u) = {u, ([m^/"J + 1)"]. Since [giXk) - g{j°')]lxkeJiu) = Jj^ g'iu)lxke.J{u)du, we have 

oo n „ 

|^n(^7;r)| < n-'/^Y^Y. 3\u)[lx,eJiu)-nix,eJiu)\:Fk^i)]di 

j=r k=l "^^3 

\g'{u)\\Sn{u)\du. (86) 
By the Cauchy-Schwarz inequality, |A„((7;r)p < \Sn{u)\'^Wy-^{du) since g G G-y,^- So 

lim sup lim sup E* < sup |yl„((y'; r)p > < limsup / \\Sn{u)\\'^w^_fj_{du) 

r— >oo n— >oo I 36^7, p I r— >oo Jr" 

/■oo 

< lim sup a / P[r <Xi < (t + !)"](! + t")T-^r-^dt = 

in view of E[(|Xfc|i/°)°(^-^)+"-i] = E{\Xk\^) < oo. In the case that /i = 1, let Ij = (2^ 2^'+i] 
and J{u) = {u,2u]. It is easily seen that the above argument still works, 
(ii) Let g = n^^s- By (jHSD and (jSHI), if < r/ - 5 < 1, then we have 



E 



sup \Br,{g;r) 



< J](1+jTv^ 
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and 



E 



sup \An{g]r) 



< 



(1 + uY^/¥[\Xi\ e J{u)]du 
(1 + t;°)VlPK < l^il < ('« + 

/>oo 

< C„,5 / i;"VipK < l^il < + 



< 



which in view of fll7p approaches zero if r — > oo. The case that 7] — 6 = 1 can be similarly 
dealt with. ■<) 

Lemma 12. Lei Q = Q^/^^ or Tiri^s- Then for any r > 0, the process {Mn{g'^\-\<r) '■ 9 ^ Q} 
is tight. 

Proof. Consider first Q = G-y,^- Recall (j^Hj) for the definition of the essential supremum 
norm 4. Then d2{g) < \\g\\oo ■= sup^gR \g{x)\ and 

N{U, l|.|<r^7,^, < N{U, l\■\<rGy,^„ \\ " ||oo)- 

Let C^^f^ be the constant in (|53p and define the Sobolev class 



S = i h : [— r, r] 



sup < C^,^(l + r)^and /" \h'{x)\^dx < {1 + ry^^'l 

x£[—r,r] J —r J 



Then there exists a constant C = C{r,'~f,fi) such that for every e > 0, 

logiV(e,5,|| ■ lU) < -. 



^7) 



[cf. Birman and Solomjak (1967) or Theorem 2.7.1 in van der Vaart and Wellner (1996)]. 
For every g G 'i-\-\<rG'y,^i, it is easily seen by (j53|l that sup^g[_^,.] |5'(a;)|^ < C^,^(l + and 
Zr \9'{x)\'^dx < (1 + r)T-^. Hence 



N{U, l\.\<rG- 



7,^) II II oo. 



< N{u,S, 



and consequently 



J ^JhgN{u, l\.\<rG-y,fi,d2)du < J ^/\ogN{u,S, \\ ■ \\oc)du < oo. 
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Therefore the lemma follows from Theorem 3.3 in Dedecker and Louhichi (2002) [see also 
Section 4.2 therein]. The case that Q = Tir^^s can be similarly proved. <C> 
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